PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa20g071210.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family HB-other
Protein Properties Length: 1703aa    MW: 191379 Da    PI: 5.0943
Description HB-other family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa20g071210.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox56.45e-182480157
                    TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHHC CS
        Homeobox  1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakekk 57
                    ++kR   t+ qle+Le+ ++ ++yps++ r++L+ kl+L++rq ++WF+ rR k++k
  Csa20g071210.1 24 KSKRKMKTAAQLEVLETTYAAEPYPSEAIRADLSVKLNLSDRQLQMWFCHRRLKDRK 80
                    679****************************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.605.2E-17780IPR009057Homeodomain-like
SuperFamilySSF466897.27E-151780IPR009057Homeodomain-like
PROSITE profilePS5007116.2442181IPR001356Homeobox domain
SMARTSM003899.7E-162385IPR001356Homeobox domain
CDDcd000867.78E-132481No hitNo description
PfamPF000461.2E-152480IPR001356Homeobox domain
SMARTSM005712.0E-22534593IPR018501DDT domain
PROSITE profilePS5082715.784534593IPR018501DDT domain
PfamPF027911.7E-16535590IPR018501DDT domain
PfamPF050661.1E-15716784IPR007759HB1/Asxl, restriction endonuclease HTH domain
PfamPF156121.6E-6912953IPR028942WHIM1 domain
PfamPF156131.0E-1210851157IPR028941WHIM2 domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1703 aa     Download sequence    Send to blast
MEGGSDEATE KNTKTPPEEG GESKSKRKMK TAAQLEVLET TYAAEPYPSE AIRADLSVKL  60
NLSDRQLQMW FCHRRLKDRK STTTTPSSKR QRKELITTPM AVESPKPAVN AADLVAGNEF  120
DSRRAARVSG SGSGSGGVTV VRRFNEPSSA EVRAVGYMEA QLGERLRDNG PILGMEFDPL  180
PPGAFGMPIE MPSHRKATRQ PFETSLYVRS DVKPSKDVRP IREYQFLPDL PSSRTDHSER  240
VSPSHHYGVP LDASVMRATT VSAGHRDGYK VSPQIPNLNL ATHQGKPGHV YSPNLPEYDS  300
PYQKSYMDTP AQRNLNDHPI HEDPFLKSER EVGNEDEDDD ALQLERKRKN EEARISRELE  360
AHEKRIRREL EKQDMLRRKR EEQLKKEMER QYRERRKEEE RLLRERQREE ERFMKEQMRE  420
LQRREKFLKK ETMRAEKMRQ KEEMRREKEV ARLKAANERA IARKIAKESM ELIEDERLEL  480
MEVAALTKGL PSMLALDFET LQNLDEYRDK QALFPPTSVK LKKPFTVKPW NGSDENVANL  540
LMVWRFLITF ADVFGLWPFT LDEFAQAFHD YDPRLMGEIH IVLLKTIIKD IEGVTRTLLT  600
GVGANQNAAS NPGGGHPHVV EGAYAWGFDI RSWRRNLNVF TWPEILRQLA LSAGLGPQLK  660
KRNIKTVSVH DENEANNSEN VIFNLRKGVA AENAFAKMQE RGLSNPKRSR HRLTPGTVKF  720
AAFHVLSLEG EKGLTILDVA EKIQKSGLRD LTTSRTPEAS VAAALSRDTK LFERVAPSTY  780
CVRASYRKEA GGAETILAEA RERIRAFKSG ITDVEDVDDA EREEDSESDV GDDPEIDLNP  840
KKEDPDALDI ENSVKVEPVL ENGKTKAGLP LTPSLPEDIK DEKRDDILVD QSLEDAVAND  900
ADSACFDESK LGEQWVQGLV EGDYSNLSSE ERLNALVALI GIAIEGNTIR VALEERLEVA  960
SSLKKQMWSE VQLDKRWKEE SLLRANYLSY PTAKPGLNIA TPASGNQEIS SADVTPISSQ  1020
DPLSRPQMDV NNVIAGPSLQ LQENVSGMEN LQYQQQGGYT ADRERLRAQL KAYVGYKAEE  1080
LYVYRSLPLG QDRRRNRYWR FSASASRNDP GCGRIFVELQ DGRWRLIDSE EGFDYLVKSL  1140
DVRGVRESHL HFMLLKIEAS FKESVRKNLE ATPGGLCSIS SSLDSDTAEI STTFKIELGD  1200
SNAIERDSVL QRFQSFEKWM WDNMLHPGAL SALKYGAKQS SPLFRICRTC AGLHFVEDIC  1260
CPSCGQMHAS PDVGELCFAE QVAQLGDNSR GGDTGFILRS SISSPLRIRL LKIQLALVEA  1320
SLPPEGLEAF WTEKLRKSWG LKLLSSSSPE ELNQVLTTLE VALKRDFLSS NFETTSELLG  1380
LPEEALPSDL TCMVNVLPWI PKTTGGVALR LFEFDSSIVY TPDQNNDPLK DKESEDLMGL  1440
ETNLLRNVPE KDVMETPVQG GYMQEENWTD PGLGGVSSSG RGGRPPRGRG RPRSRGSGGN  1500
GKKPAVSSSR PPRGAANTNG ETMLRPRAQP RGKKNGRRSS TKGRKRPTKG TLGISNEVVG  1560
GRLSKEVAVT AKTTLPDNED DWIETPELQD DDGEASSSGR SFQYKDYDDD EVMAPMDDFD  1620
DVGESSKLVG RGEFSLHSDD EYEEEDEEEE EEEDMNTKMD VDYINDDSFG RREQPEISND  1680
TARKRFMFDD PDLTSSSSSD YR*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
115321546KKNGRRSSTKGRKRP
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAY0428930.0AY042893.1 Arabidopsis thaliana Unknown protein (MLN1.10) mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010494421.10.0PREDICTED: uncharacterized protein LOC104771575
SwissprotQ9FFH10.0RLT2_ARATH; Homeobox-DDT domain protein RLT2
TrEMBLR0EUK20.0R0EUK2_9BRAS; Uncharacterized protein
STRINGAT5G44180.10.0(Arabidopsis thaliana)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM96012530
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G44180.10.0Homeodomain-like transcriptional regulator